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. We study the size of connected components of random nearest- 

neighbor graphs with vertex set the points of a homogeneous Poisson 
point process in R d . The connectivity function is shown to decay 
superexponentially, and we identify the exact exponent. From this 



Qi^ ' we also obtain the decay rate of the maximal number of points of 



•3 



a path through the origin. We define the generation number of a 
point in a component and establish its asymptotic distribution as 
the dimension d tends to infinity. 

1. Basic definitions and results. Let X be a homogeneous density 1 Pois- 
son process in M d with an "extra" Poisson point located at the origin. Let 
< ^ • Qd denote the directed graph whose vertices are the Poisson points and in 

which there is a directed edge from s £ X to s' G X if s' is the nearest neigh- 
\Q • bor (NN) of s. Ignoring the directions of the edges leads to an undirected 

. graph which we denote by just Qd- 

| The graph Q$ was introduced and studied in [1]. They showed that Qd 

. contains a.s. no unbounded component, in any dimension d. In the current 

*^ paper, we are interested in the tail behavior of components in Qd] rather 

than only stating that they are finite, we would like to know how large these 
clusters typically are. To this end, we make a few definitions. 

We denote by pd(n,L) the probability that there is a directed path in Qd 
starting at the origin, touching exactly n distinct points (besides the origin) 
and ending at a point s with \s\ > L (where | • | denotes Euclidean distance). 
■ Furthermore, t^(L) is the probability that there is a path in Qd starting at 

the origin and ending at a point s with |s| > L. 

Since in a Poisson process the distances between any two pairs of points 
are different a.s., each point has an a.s. unique NN. It is quite possible that 
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two Poisson points are each other's NN. In this case there is a mini-loop of 
two directed edges in between these two NN's. In fact, since each Poisson 
point must have an NN, and all components are finite, each component 
contains at least one such closed mini-loop. This would happen when a 
Poisson point s is an NN of one or more other Poisson points, and when one 
of the latter is an NN of s as well. Furthermore, each component contains 
exactly one closed mini-loop, as the existence of more than one such closed 
mini-loop would imply the existence of more than one NN for some Poisson 
point in the component. 

It is not possible to have any type of closed circuit other than the one 
described here, as the existence of any closed circuit involving more than 
two Poisson points would contradict the fact that the lengths of successive 
directed edges are decreasing. The existence of an (a.s.) unique NN implies 
that along any path in Qd there is at most one change of direction (in the 
arrows of Qd), arid the change of direction must take place at one of the two 
Poisson points in a loop. 

We finally mention that it is a standard fact that there is a uniform and 
finite upper bound, depending on the dimension d, for the number of points 
that have the same point as their nearest neighbor. The maximum number 
of such points is called the kissing number and denoted by Kd < co; see, for 
instance, [3]. 

There are at least two ways to measure the size of a component in Qd'- 
one can look at the diameter of the component, or at the number of points 
in the component. First we state a result that tells us that the diameter 
of a component decays faster than exponentially, and which specifies the 
exponent exactly. For a related result in a discrete setting, see [2]. 

Theorem 1.1. There exist constants C\, Ci, Lq G (0, co) (depending 
only on the dimension d) such that 

(1) e -<hL<yxL)W < Td{L) < e -c 2 i(iogi)C«-i>/« fQr L > Lq _ 

Turning to the number of points in a component, we let pd(n) denote the 
probability that there is a path in Qd through the origin touching more than 
n distinct points. 

Theorem 1.2. There exist constants C[, C 2 , Nq £ (0, oo) (depending 
only on d) such that 

e -C>logn < < e -G>logn ^ n > ^ 

For fixed dimension, it seems difficult to make more precise statements 
about the number of points in a component. Therefore, we investigate what 
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happens in the limit when the dimension d — > oo, and we will obtain some 
indirect information about the size of a component via the so-called genera- 
tion number of a point. We already remarked that any component contains 
exactly one mini- loop of two arrows. The points in this mini-loop are given 
generation number 1. A Poisson point x receives generation number k if 
the graph distance to the unique mini-loop in the component of x is equal 
to k — 1. We denote by gd{k) the probability that the origin has genera- 
tion number k, in dimension d. The following result is an indication that 
components typically are very small. 

Theorem 1.3. We have that 

&* ( * )= (*Tl)!' k = l,2, 

Section 2 contains the proofs of Theorems 1.1 and 1.2, while Section 3 
contains the proof of Theorem 1.3. 

2. Proofs of Theorems 1.1 and 1.2. In this section W\, . . . , W n are i.i.d. 

Revalued random variables whose common probability density is given by 
e -V(B(o,\w\)) _ e ~n d \w\ _ rj,^ i ower bc-vmcl is based on the following. Note 
that Td(L) >Pd(n,L) for every n. 

Proposition 2.1. There exist constants b\,c\ € (0, oo) (depending only 
on d) such that 

(2) Pd (n,L)>^fe~^ L ^ d . 

As will be seen from the proof of the proposition, one may choose any 
9 £ (0, j] and take b\ to be the probability that a uniformly distributed 
random point on the unit sphere S d ~ l falls into the "polar cap" of open- 
ing half- angle 9. The corresponding c\ may then be taken as where 
ltd = V(B(0, 1)), where V denotes Euclidean volume and B(x,r) denotes the 
(open) ball of radius r centered at x € M. d . 

For the upper bound we need to bound not only pd(n,L) but also some 
closely related quantities that we now define. 

For j G {0, . . . ,n} we define pd(n,L,j) to be the probability of the event 
E(n,L,j), that there are two directed paths in Q&\ one from to some s' , 
touching exactly j particles (besides 0) and one from some s to the same 
s', touching exactly n — j particles (besides s') and such that |s| > L. Thus 
Pd(n,L) = pd(n,L,n) and furthermore (by the properties of the directed 
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graph Qd mentioned above and) since t^{L) is equal to the probability of 
Un=iU?=i£(n,£,i),wehave 

oo n 

(3) r d (L)<£5> d (n,L,j). 

n=lj=0 

The upper bound is based on the following: 

Proposition 2.2. There exists a constant C2 £ (0, oo) (depending only 
on d) such that 

Pd(n, L,j) < ) Kd > P(\Wi + --- + W n \> c 2 L) 
j\{n-j)\ 

( 4 

^ T, + --- + \Wn\> C 2 L), 

j\{n-j)\ 

where W\, . . . , W n are i.i.d. W 1 -valued random variables as described earlier. 

As a corollary to Proposition 2.2 and inequality (3), we have the following. 

Proposition 2.3. r d (L) < e 2K P{U > c 2 L), where U is a random vari- 
able withE(e rU ) = e 2KE ^ lWl1 ^. 

Proof of Proposition 2.1. Given (xi, . . . ,x n ) G (R d ) n we define s = 
and si = x\ + ■ ■ ■ + Xi for i = 1, . . . , n. Let the set of points (xi, . . . , x n ) 6 
) n satisfying the following three conditions be denoted by S: 



(i) |xi| > |x 2 | > ■ ■ > |x n |, 

(ii) \s n \ > L, 

(hi) Si U}=i Bj fori = 1, ... ,n, where I?j = B(sj-i, \xj\) is the open ball 
centered at s^_i of radius \xj\. 

Note that [because of (i)] condition (hi) may be replaced by: 

(hi') Bi n {s , ...,s n } = s/_i for I = 1, . . . , n. 
We now claim that 

(5) pa(n,L)= / e vu j=i j ' dx\ ■ ■ ■ dx n . 



s 

Since we use this type of equality (a variation on Campbell's theorem) 
a number of times, and since we have not been able to hnd a proof in the 
literature, we spend a few lines on the proof of (5). Our proof proceeds 
by a suitable discretization of For A; = 1,2,..., we consider (nested) 
subdivisions of M. d into d-dimensional cubes of side length 2~ k . We denote 



CONTINUUM NEAREST-NEIGHBOR GRAPHS 



5 



by T>{S) the collection of re-tuples (D k ,...,D k ) of these cubes with the 
property that for all s\ G D k , . . . , s n G D k we have (si, . . . , s n ) G 5. 

Denoting the event in question by J5, we denote by Ek the event that 
in addition, all points s\,...,s n of the directed path are the only Pois- 
son points in their respective cubes D k ,...,D k , of side length 2~ fc and 
such that (D k , . . .,D k ) G £>(<S). It is clear that E k ^ E and that P(-Efc) -> 
P{E), as A; — > oo. We therefore need to compute limfe_ +00 P{E^). Denoting 
the integrand in (5) by /, we write f(D k ,...,D k ) for the expectation of 
f(X\, . . . ,X n ) where the Xi's are independent and uniform over Di, respec- 
tively, i = 1, . . . ,n. 

Note that E^ occurs when there is an re-tuple (D k , . . . , D k ) € D(S) such 
that each box D k of side length 2~ k in this sequence contains exactly one 
Poisson point, such that in addition, the correct balls around these points 
contain no further Poisson point. This gives 

J2 (2- dk + o(2- dk )) n f(D k 1 ,...,D k n ) 



P(E k ) 



-dnk 



D*)6X>(5) 



,D k )+o(2 



-dnk\ 



(£>} 

f(x 1 ,...,x n )dxi 



■ dx r 



as k — > oo, proving (5). 

Now let be the polar cap of half angle 6 < j in the unit sphere of ' 
with vertex at the origin; that is, 

t>d . I, 



C(9) = {x = (xi,...,x d ) g: 



1 and x\ > cos#}. 



Since 
(6) 



it follows [using the notation of Proposition 2.2, letting Si = W\ + • • • + Wi 
and Bj = B(Sj-i, \Wj\)} that 



Pd(n,L) > 



dx-\ 



Pi |Wi|>--->|W„ 



>P\ |Wi|>--->|W n 



dx r 



3=1 
n 

J2 W 3 

3=1 



> L and Si £ Bj for each 



Wi 

> L and-—— G C(9) for each z 
\Wi\ 
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/ n \ 

> (b(9)) n P[ \Wi\ > ■ ■ ■ > \W n \,J2 \Wj\ cos(0) > L 

V j=i / 



(H0)T 



L 



COS I 



> 



(6(0))' 



??,! 



-P( |Wi| > 



L 



ncos(9) 



(b(9)) n Uri/(cose) d )n(L/n) d 



ill 



Here b(9) represents the probability that Si lies in a cone of half angle 
9 < f with vertex at Si_i. The result now follows from the approximation 

b d(9) n ^ e n(log(6(6>))-logn+l)_ □ 



To prove Proposition 2.2 we need Lemma 2.4 below which should be 
compared to (6). As a replacement for (5), one can straightforwardly obtain 
the inequality 



(7) 



p d (n,L,j) < 



Here = _B(sj, and Sj is the set of points (x\, . . . ,x n ) € (M ) n sat- 
isfying the three conditions (where again so = and Sj = x\ H \- Xi) : 

(i) |^i | > ■ • ■ > and < • • • < \x n \, 

(ii) \s n \ > L, 

(hi) Bif]{s ,.. . ,s n } = si-i for 1 = and J B[n{s , ...,s n } = si for Z 

i + l,...,n. 

Lemma 2.4. Let (x±, . . . ,x n ) 6 Sj. Then 
(3 n \ 

(8) 



,i=l 



v(U^ U B'A>± 

i=j+i ) 



j_ 

i=l 



J2V(Bi)+ J2 V{B>) 

i=j+i 



Proof. This is a consequence of the fact that no point x can lie in the 
intersection of more than K d of the balls -Bj. To see this, note that if this 
were not so, then x would be the closest point (among {x,qi, . . . ,q n }) to 
more than K d of the g^'s, contradicting the definition of the kissing number 
K A . □ 



PROOF of Proposition 2.2. From now on we call K d merely K. Start- 
ing with (7), we apply Lemma 2.4 to find 

p d (n,L,j)< I e-^EL^e-d/^E^^)^...^ 
Js 3 
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Js j 

f n 

= / f[e- {1/K)7rdlxj]d dxf-dx n . 

By the change of variable x, = y^K^ 1 /^ for each i, the last integral be- 
comes 

„ n 

(9) K n \e-^\ d d yi ---dy n , 

where <Sj is the set of (yi, . . . , y n ) € (R d ) n such that • • • , K^/% n ) € Sj. 

To get an upper bound on (9) we simply drop the third condition in the def- 
inition of Sj\ that is, we replace <Sj with S'- ', the set of (y±, . . . , y n ) satisfying 
only the two conditions: 

(i) 1 2/1 1 > •■• > \yj\ and \y j+1 \ < ■ ■ ■ < \y n \, 

(ii) \Y.UK^ Vl \>L. 

This yields the bound 

L \ 



K n i 

p d (n,L,j) < — — P\ 

j\{n-3)\ \ 



i=l 



> 



K n ( n L \ 

Proof of Proposition 2.3. From (3) and (4) we have 

oo n jy- n 

r d (L) <EE TIT^y P « + " • ■ + W»l > 

n=lj=0 J '" ■>>■ 



(10) 



E —hnwi + • • • + w n \ > C2 l) 

n=l 



oc 



< e 2 * £ e-^^P(|^| + • • • + \W n \ > c 2 L) 

n=0 



= e ZK P(U>c 2 L). 

The last equality follows by taking U to be a compound Poisson random 
variable \Wi \ + • • • + \ W^\ (where N is independent of \Wi\, \W 2 \, ■ ■ ■ and is 
Poisson with mean 2K). By a standard calculation, 

(11) E (e rU ) = e 2KE ^ WlUl \ 



8 I. KOZAKOVA, R. MEESTER AND S. NANDA 

This completes the proof of Proposition 2.3. □ 

PROOF of Theorem 1.1. Taking ra« n^rhr/d m (2) we get the lower 
bound in (1). 

For the upper bound we use large deviation bounds on P(U > C2L): 
P(U > c 2 L) < exp (inf {\og(E(e rU )) - rc 2 L}\ 

[_r>0 J 

(12) = exp j inf {2if£(e r|w/l1 -1) -rc 2 L} J 



< exp I inf {2K'{e crd/id 1] - rc 2 L)}}. 



The last inequality follows from the fact that E(e r \ Wl \) < e CT ' d/(d :> for 
large r (as can easily be shown). Taking r ~ a(\ogL)( d ~ 1 ^ d for an appropri- 
ate constant a we get t^{L) < e _c ' 2i ( lo s i ) d x) d for large L. □ 

Proof of Theorem 1.2. The lower bound follows from Proposition 
2.1 by taking L = 0. For the upper bound, we note that in order for a path 
with the required properties to exist, there has to be a path from the origin, 
touching at least [§J points with at most one change in direction. Using 
Proposition 2.2, this leads to 



\n/2\ 
Pd{n) < Pd 



j=0 



2 



,0,j 



\n/2\ R \n/2\ 



</ 



j!(ln/2j-l)! 
[n/2\\ 

e (n/2)log(21f d )-(n/2)(logn-log2-l) 



< e -c 2 nlogn 

for large n, proving the result. □ 



3. Proof of Theorem 1.3. We shall need the following simple fact. Define 
L d (a,b,y) = V(B(s,a)nB(t,b)), 
where s and t are two points in R d at distance y. 
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Proposition 3.1. For each fixed y>l we have 

Ld(l,y,y) 



0, 



as al — > oo . 



Proof. Note that Ld(l,y,y) is a lens contained in a cylinder of radius 
r = r(y) < 1 and height 1. The volume of this cylinder is itd-ir d = o(iVd). d 

Proof of Theorem 1.3. For notational convenience, we write Bij for 
B(si, \xj\), where as before Xj = Sj — Sj-i, for j = 1,2, 

The set of points (si,...,Sk) G (M a! ) fc satisfying |a?i| > |a?2 1 > • • • > |a?fc| is 
denoted by U^. Furthermore, the subset of which in addition satisfies 

i 

Si ^ U ^'rn—\,m 
m=l 

for i = 2,... ,j is denoted by Uj, for j = 2, . . . , k. For convenience we define 
U[ =Uk- As in Section 2 we may now write 



= / e -V(Bo,i) e -V , (Bi,a\Bo,i) e -V(Ba,s\(Bo,iUBi,a)) 

Js=(si,...,s k )eW k 

x ... x e~^ / '-' Bfc_1 ' feU ' Bfe ' fe ^Ui=i 1 Bi-i,!-) ^ s 

We now first estimate this from below, and after that we show that the 
error we make by doing this, tends to zero when the dimension tends to 
infinity. The first step is to replace the volumes in the exponents by the vol- 
ume we would get without subtracting anything from the first set mentioned 
in each exponent. Thus 

(13) g d (k)> f e -Kd\xx\ d e -*a\x2\ d ... e -* d \x*-i \ d e -^d\*k\ d ds. 

J s={sx,...,s k )eu' k 

Writing the integrand in this formula as W(s), the next step is to rewrite 
this as 




that is, we have an integral over as our leading term, and subtract from 
this the integral over those sequences s which have a first index j for which 
sj falls into a previous ball, j = 2, . . . , k. 
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To compute the leading term, note that a simple change of variables gives 
that 

(15) / W{s)ds= [ e~ yi ■■■e- yk - 1 e- 2yk dy k ---dy 1 . 

JU k J0<Vk<Vk-l<"<Vi 

This integral can be computed explicitly, but its value is most easily 
found and understood via a simple probabilistic interpretation. Indeed, the 
integral is equal to 1 /2 times the probability that Y\ > Y<l > ■ ■ ■ > 1*. , where 
the Yj's are independent, where Y%, Y2, • • • , ife-i are standard exponentially 
distributed and where Yf- has an exponential distribution with parameter 2. 
The probability that Y\,Y2, ■ ■ ■ are ordered this way is just l/(k — 1)!. 

Since has the same distribution as the minimum of two independent 
exponentially distributed random variables, the probability that one of these 
two will be the smallest among the k + 1 random variables in question is 
simply 2/(k + 1). It follows that the integral is equal to 

112 k 

2(jfe-l)! (fc + 1) ~ (fc + 1)!' 

Next we will show that the remaining terms in (14) tend to as the 
dimension d tends to infinity. For this we need to bound 

W(s)ds, 

for j = 2, . . . , k. For given j, this integral is over those sequences for which j 
is the first index so that Sj falls into a previous ball. Hence 

W{s)ds 

u>_ x \u> 

< j e —Xd\xi\ d j e -^d\x2\ d I e -7T d |x 3 _l| d 



f e -^\*A d dsydSx 

• I I' . Bi-t.i 



11 :.. ;' U.' , B 

Tdkl| d / „-1"dk'2| d / --K d \Xj-\_\ d 



J Sl eR d Js 2 eB 1 A Jsj-!eBj-2,j-2 

Yji = iV(Bj-ij-ir\Bi-ii) f _ 7T ,\ Xi \d 

v \ B i~i,i-V it. •.. . 

where in the last inequality we have used the fact that if is decreasing 

in 1 3; I, then for y with \y\ > 1 it is the case that 

/ /(l^^ gSf l AM)*- 

JxeB(0,l)nB(y,\y\) V{B(0,1)) Jx€B(0,l) 
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The volume in the numerator can be estimated by the volume of the largest 
possible intersection, which is the one with the first ball in the situation that 
Sj-i lies on its boundary: 

E£i V(Bj- ltj -i n Bi-i ti ) W/ . Ld(|sj -i|,|3;i|,|3;i|) 



< (J " 1) 
(J " 1] 



V(B(0,\ Xj ^\)) 



L d [l, 



X.i-1 



Xl 



Xj-l 



We now continue our estimate as follows, using the change of variables 
Hi = K d \xi\ d : 



[ W(s) 
JUi AW. 



ds 



< 



-^d\xi\° 



\xi\>0 



d7Td\xi 



|*2|<|a!i| 



X j - 1 \<\x j -2\ 



7Td 



Xl 



Xj-i 



Xl 



Xj-l 



Xj\<\xj-l\ 



- n ^ d d7T d \x j \ d ~ 1 d\x j \---d\x 2 \d\ 



Xl 



(j - 1) 

0<Vj<-<Vl 



L d [ 1 



Vi 
Vj-i 



l/d 



Vi 
Vj-i 



l/d 



< 



0<y,<-<2/i ^d 



e ^>i=i Vi dyj ■ ■ ■ dyi 
^L d (l,^^)e-^ dyj ...dy u 



Vj-i Vj-i- 

which tends to zero according to Proposition 3.1 and dominated convergence. 
It follows from all this that 

The final observation is the following: since the sum over k of k/{k + 1)! is 
equal to 1, the inequality is in fact an equality, and we conclude that 

k 



= {k + 1)V 



A; = 1,2,. 



proving the theorem. □ 
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